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Abstract. The structure representation of data distribution plays an 
important role in understanding the underlying mechanism of generat- 
ing data. In this paper, we propose nearest prime simplicial complex ap- 
proaches (NSC) by utilizing persistent homology to capture such struc- 
tures. Assuming that each class is represented with a prime simplicial 
complex, we classify unlabeled samples based on the nearest projection 
distances from the samples to the simplicial complexes. We also extend 
the extrapolation ability of these complexes with a projection constraint 
term. Experiments in simulated and practical datasets indicate that com- 
pared with several published algorithms, the proposed NSC approaches 
achieve promising performance without losing the structure representa- 
tion. 

Keywords: Topology; Persistent Homology; Object Recognition; Supervised 
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1 Introduction 

The structure representation is important to understanding the underlying mech- 
anism of generating data. To capture such structures, manifold learning algo- 
rithms 1 1121 assume that data are generated from an underlying low-dimensional 
manifold. However, it is not easy to discover and preserve the topological struc- 
ture hidden in the manifold. For example, [3] pointed out that the view and 
style-independent action manifolds, which are used to describe human activi- 
ties, can be assumed to lie in a torus. 

Persistent homology can effectively discover the topological invariants such as 
holes, which cannot be easily available by other means such as manifold learning 
algorithms (4] . The method first incrementally constructs nested families of sim- 
plicial complexes from point cloud data (PCD), and then computes the lifecycle 
of each possible topological invariant by placing the complexes within an evolu- 
tionary growth process. Finally, it extracts those truly topological invariants or 
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features with longer lifecycle and removes topological noises [5]. However, how to 
employ it for practical applications (e.g., object recognition) remains unsolvable. 

In this paper, we propose a novel method, called nearest prime simplicial 
complex approaches (NSC), to obtain a structure-preserving representation and 
achieve higher performance in object recognition. Specifically, we generate a 
nested family of simplicial complexes per class, and estimate a prime simpli- 
cial complex per class by weighting the lifecycles of alive topological structures. 
Then we classify objects based on the nearest projection distances from each 
object to simplices in these simplicial complexes. Furthermore, we also utilize 
a projection constraint term to enhance the extrapolation ability of NSC and 
prevent incorrect projection. The main contribution is that we extend the geo- 
metrical framework of simplicial complex to object recognition. Specifically, we 
propose NSC approaches to object recognition and show how to use them for 
point classification. Experiments in several simulated and practical datasets show 
that without losing the structure representation, the proposed NSC approaches 
attain promising performance compared with several well-known algorithms. 

The remainder of this paper is organized as follows. In Section [2j we will 
introduce some preliminary of simplicial complex and give a brief survey on 
persistent homology. In Section [3] we will detail our proposed NSC algorithm. 
We evaluate the performance of the proposed NSC approaches in Section |4| We 
conclude the paper in Section [5] 

2 Preliminary and Related work 

In this section, we will introduce some preliminary of simplicial complex and 
the construction of simplicial complex, and survey the development of persistent 
homology. 

2.1 Preliminary 

The simplicial complex is a collection of simplices subject to some rules. The 
simplex and simplicial complex are defined as follows: 

Definition 1: Let {vq, v\, ■ ■ ■ , v p } be a geometrically independent set in M. N . 
We define the p-simplex a spanned by v§,v\, ■ ■ ■ , v p to be the set of all points x 
of such that [6j: x = X)f=o where X)f=o ^» = 1 an( ^ ^> ^ ^ 0- 

In general, each p-simplex a has p + 1 faces which are (p — l)-simplices. The 
face is obtained by deleting one of the vertices v ,vi, • • • ,v p . 

Definition 2: A simplicial complex K in R N is a collection of simplices in 
M. N such that [5J: 1) Every face of a simplex of K is in K. 2) The intersection of 
any two simplices of K is a face of each of them. 

An illustration on the distinction between simplicial complex and non-simplicial 
complex is shown in Fig. [T] Obviously, the non-simplicial complex violates the 
second rule. 

In this paper, we mainly utilize Lazywitness complexes, which behaves like 
Delaunay triangulations computed in the intrinsic geometry of the data set X, 
to construct the simplicial complex for PCD. 



NSC for Object Recognition 3 



Fig. 1. The distinction between non-simplicial complex (left) and simplicial complex 
(right). 

Specifically, we select a subset Z = {vi, ■ ■ ■ , v p } C X as the vertex set by us- 
ing sampling techniques such as max-min sampling or random sampling methods 
at first. The max-min sampling method randomly extracts one point as the first 
vertex, and iteratively selects the next n — 1 points that maximize the minimal 
distances between them and the previous vertices. With this way, the method 
generates a vertex set uniformly distributed around the data structure [3]. Then 
wc utilize the remaining points as the witness points set {101,102, • • • ,w q } to 
determine which simplices occur in the complex ^ . 

More formally, let D be a p x q distance matrix, where q denotes the num- 
ber of the remaining points. Each element D(i,j) measures the distance be- 
tween landmark point i and witness point j. To discover a persistent topologi- 
cal invariant from PCD, we construct a nested family of simiplicial complexes 
W(D;R, /). Here R is the radius of metric ball, and / is a non-negative in- 
teger. If /=0, we define rrij — 0, Wj = 1,2, •• • ,q. Otherwise, let rrij be the 
/-th smallest entry of the j-th column of matrix D. Then wc utilize two rules 
to determine which simplices can be added into the complex [2]: 1) a 1-simplex 
a = [v a Vb\ will be added to W(D; R, /) iff there exists a witness Wj (1 < j ^ q) 
satisfying m.ax(D(a, j),D(b,j)) < R + rrij. 2) a fc-simplex [v ao v ai ■ ■ ■ v ak ] will 
be added to W(D; R, f) iff there exists a witness Wj (1 ^ j ^ q) satisfying 
max(D(a J),D(a 1 ,j),--- ,D(a k ,j)) < R + rrij. 

Note that when the number of training samples is small, which is very com- 
mon in object recognition domain, we can instead use the Rips complex method 
to obtain the nested families of simplicial complexes. Assuming that each point 
is a center of a closed Euclidean ball with radius R, Rips method iteratively 
builds a complex by forming a line in any two points if the balls of them are 
intersected [1]. 

2.2 Related work 

Persistent homology is to discover some stable topological invariants from PCD. 
To achieve the goal, there are three crucial steps [5 : 1) selecting a subset that 
expresses the non-trivial topological attributes measured by homology groups, 2) 
measuring the importance of these subsets and 3) eliminating those topological 
attributes with the minimum number of side-effects. 
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[4)7j investigated the influence of sampling technique to the estimation of 
topological invariants. With persistent homology and sampling strategy, |4l7j 
discovered that image patches with edges lie in a Klein-bottle-shape space. [5] 
proposed to use geodesic Delaunay triangulation to reduce the number of sam- 
ples, which is required to capture the topology of PCD. 

To discover the topological structure from data cloud points, it is neces- 
sary to construct the simplicial complexes. There are several different complexes 
including Cech, Rips, Explicit, Witness and Lazywitness complexes in litera- 
tures. Let PCD be X, and the radius of PCD be R. Specifically, the Cech com- 
plex Cech(X, R) means the nerve of the collection of metric balls {B(xj), R/2}, 
Xj € X, j = 0, 1, • • • ,p [5], with vertex set X. [5] proposed a-complexes, and 
estimated the invariants through computing the persistent Betti numbers. For 
saving storage space, Rips(X, R) only stores the edges and vertices, and forms 
the largest simplicial complex that has the same 1-skeleton (i.e. vertices and 
edges) as Cech(X, R). However, both of the two methods produce a very large 
amount of complexes, especially for large-scale PCD. To refine the efficiency, [I] 
proposed witness complex by selecting a group of landmark points and utilizing 
the remaining points as witness of the existence of simplicial complexes. 

[lOj employed Barcode technique to measure the importance of topologi- 
cal attributes. Furthermore, [H] applied the persistent homology to extract 
some topological features from character-shape point cloud data. [H] studied 
the smallest coverage issue in sensor networks based on the persistent homology. 
Assuming that stratified spaces consist of multiple manifolds or non-manifolds, 
each of which has varying dimension, |13j generalized the computation of persis- 
tent homology to that of intersection homology for better analyzing the stratified 
spaces. Moreover, [T3] clustered data points into different stratified space using 
methods derived from kernel and cokernel persistent homology. [15] investigated 
the persistent homology of random fields and manifold learning. A major diffi- 
culty is that it is not easy to fill the gap between the persistent homology and 
practical applications. 

3 Nearest Prime Simplicial Complex 

In this section, we will detail the NSC approaches by dividing them into two 
parts. 

3.1 Selecting prime simplicial complexes 

To utilize the persistent homology for recognition, we propose three crucial steps 
including eliminating the redundant simplices, recording a recognition-related 
Barcode and selecting the prime simplicial complexes. 

With the methods mentioned above, specifically, we can construct a filtered 
simplicial complex from the point cloud data by increasing R from to oo. The 
filtered complex is an increasing sequence of simplicial complexes which deter- 
mine an inductive system of homology groups [10] . To our research, we discover 
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that in this sequence, a proper complex, named prime simplicial complex, is use- 
ful for recognition. The prime simplicial complex is a relatively stable complex 
from which we can capture the homology of the data's topological structure. For 
better understanding, an example is shown in Fig. [2] 




Fig. 2. We can construct a simplicial complex through metric balls with a radius R. A 
good choice of R (left) induces a prime simplicial complex which can help us to capture 
the homology of an annulus from the union of balls. Meanwhile, the union of balls with 
incorrect radius will induce an incorrect structure representation (middle and right). 



A fc-simplex which is not a face of any k + 1-simplices of the same complex 
is a relatively highest-dimensional simplex compared with its lower-dimensional 
ones. Note that we always focus on the relatively highest-dimensional simplices 
of the prime simplical complex since their faces which are lower-dimensional 
ones have been implicitly considered in our NSC approaches. For avoiding the 
repeated computation, we propose to remove these faces when constructing the 
prime simplicial complex. Here we give a pseudo-code based on Lazywitness 
complexes in Algorithm [T] Note that in line 3, the matrix E is calculated as: 

E(i,j) = minmax(D(i, k),D*(k, j))) - m k (1) 

k 

and in line 9, the lower-dimensional simplices will be removed after the merging 
procedure is completed. 

Once the prime simplicial complex is constructed, we use Barcode technique 
to record the lifetime of each simplex belonging to the complex as the parameter 
R increases until -R max is reached. We only consider the simplices which are still 
alive when R = R max . An example is shown in Fig. |3][^j 

Obviously, it is hard to find the best prime simplicial complex from the 
sequence. Therefore, we propose to select a radius R* based on the weighted 
lifecycles: 

V™ I M- 

H - sr m a ^ > 

3 Note that our Barcode is different from that in [15j . The reason is that although |15|'s 
Barcode technique is a good way to describe the persistent homology by recording 
the birth and death time of some topological invariants, only the alive simplices are 
useful for our proposed NSC algorithm. 
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Algorithm 1 Construct the Prime Simplicial Complex using Lazywitness Com- 
plexes 

input Point Data P, R, the ratio r, the family / 

output the vertices of each simplex constructing the simplicial complex 
1: Choose p landmark points and q witness points, where p — size(P)/(r + 1) and 
q — p ■ r. 

2: Compute the p x q matrix D of distances. 

3: Compute the pxp matrix E with off-diagonal entries E(i, j) = R[ ViV .j which record 

the time when edge ViVj appears. 
4: Consider every two pairs (i, j) where i < j ^ p 
5: if E(i,j) < R then 
6: Add [ViVj] to W(D; R, /). 
7: Remove [vt], [vj] from W(D; R, f) 
8: end if 

9: Generate higher-dimensional cells inductively: the fc-simplex [v ao v ai ■ ■ ■ v ak ] occurs 
iff the three lower-dimensional simplices [v ai ■ ■ ■ v ak ], [v ao ■ • ■ v ak —i] and [wa Va fc ] all 
occur. 



where m is the number of simplices, £j is the length of the i-th barcode, and 
Mi is the radius corresponding to the median of ti. Intuitively, the shorter the 
lifecycle, the more unstable the corresponding simplex, and the less influence it 
raises to the determination of a stable and prime simplicial complex. Formally, 
let the length of the shorter lifecycles be £a,i (i = 1, ■ ■ • , s) and the others be 
(■B,j (j = 1) • " • j s ') with s + s' — m, then we can rewrite Eq. ^ as: 



EU *A,i + E ■=! *B d EU + E^! ?B j 



(3) 



When for all the lifecycles, we have Ia,{ <C ^B,j, then Eq. (|3| can be approxi- 
mated by: 

R* » ,J J (4) 

It indicates that the primal simplicial complex is less sensitive to those simplicial 
complexes with the shorter lifecycles. It is also noting that we construct a prime 
simplicial complex per class for classification. 



3.2 Classifying objects based on NSC 



Assuming that data distribution per class is represented by a prime simplicial 
complex, we attempt to classify unlabelled samples by projecting the samples 
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Fig. 3. We construct a simplicial complex (left) for a circle-shape data. In the panel, red 
dotted line and blue line denote 1-simplex and 2-simplex, respectively. Each barcode 
(right) of its simplices starts at a specific R value, and ends up at i? max which is used 
to determine when to stop the computation of barcode. In this figure, i? max is set to 
1. Those disappeared simplices haven't been shown in the figure. 



to the simplices of prime simplicial complexes. With this way, we can avoid 
projecting them to some holes and voids that may exist in the structures. The 
holes and voids will lead to incorrect projection and impair the classification 
performance. 

Specifically, let <j,- (i = 1, 2, • • ■ , m) be a fc-simplex with vertices {vq, v\, ■ ■ ■ , Vk}. 
Then the projection position x p of sample x can be defined as a linear combina- 
tion of vertices in the simplex: 

k k 
x p = AjWj, where Aj = 1 (5) 

t=0 i = 

Here A.; is the weight value. Take a 2-simplex as an example. The weight is equal 
to 

A *-\l-Ao-A l5 z = 2 (6) 

where B — [vq — vi , v\ — V2] ■ As for a 1-simplex, the weight is equal to: 

{ (x-v 1 ) T (v 1 -v ) . _ n 
{v^vor^-vo) 1 - U (7) 
1 - A i = l 

If the projection index < Aj < 1, the projection position locates inside the face. 
Otherwise, it locates outside the face. For A, > 1 or A, < 0, on one hand, it can 
lead to an incorrect projection for distant points. On the other hand, it provides 
a "forward" and "backward" interpolation along a face when the number of 
training sample is small. To make a compromise between preventing that data 
are incorrectly projected outside the face and preserving the extrapolation ability 
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of topological structure, we introduce a parameter 7 to compute the projection 
position and corresponding projection distance as follows: 

_ j Vi + (l + i)(vj-Vi), if 
p ~\«i-7(wi-«i), if Ai<- 7 lJ 

where Vj , Vj denote two different vertices of a simplex. Then the distance between 
a sample x and a simplicial complex of the c-th class is: 

d NS c(x\SC c ) = min(x - x e p c ) T A{x - x e p c ), 

£ = l,--- ,m;c= 1,--. ,C (9) 

where to denotes the number of simplices in the complex, C is the number of 
classes, and A is a non-negative matrix]^] Finally, we classify sample to a class 
that has the nearest simplicial complex distance to the sample: 

C(x) = axgmind N sc(x\SC c ) c =!,••• ,C (10) 



4 Experiments 

Experiments are performed to evaluate the performance of the NSC approach. 
Here, two face recognition datasets and eight UCI benchmark data sets [16] are 
used as listed in Tab. [TJ 

We also use five simulated datasets and four practical multi-view datasets. 
The five simulated datasets are generated from different topological structures 
plus random noise with variance p. They are 1) Dl: two concentric circles 
(p = 1.0); 2) D2: two spirals (p = 3.5); 3) D3: circle-cross-circle (p = 2.0); 
4) D4: four circle-cross-circle (p = 2.0) and 5) D5: Sphere-cross-sphere (p = 1.5) 
datasets as shown in Fig. [4j Each dataset includes 2-class, each of which has 
500 training samples and 500 test samples without overlap. We use max-min 
sampling strategy to select 50% training samples as the landmark points [4 and 
the remaining samples as the witness points to construct the prime simplicial 
complexes. Some examples of these complexes are illustrated in Fig. [4j From the 
figures we can see that the NSC approaches preserve the structure representation 
well. 

The four practical multi-view data sets used for object recognition are COIL- 
20 Q7], COIL-100 [H], SOIL-47A and SOIL-47B [Tg. The COIL-20 dataset 
consists of 20 objects, each of which has 72 different views that are sampled 
every 5° around an axis passing through the object. Each object is an image with 
size 128 x 128. We subsample them to 32 x 32 ones. The COIL-100 dataset has 

4 It can be obtained by metric learning which goes beyond the scope of this paper. 
In our paper, we set it to be an identity matrix or inverse covariance matrix. The 
former one is equivalent to a Euclidean distance. Meanwhile, the latter one leads to 
a classical Mahanalobis distance, named NSC-M. 
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Table 1. Description of several benchmark data sets. Here , "Dim" denote the num- 
ber of samples and means dimension, respectively. 'C denotes the number of classes, 
and 'RA' denotes the ratio of the number of training samples in each dataset or the 
number of training samples versus that of test samples. The latter one means that 
training set and test set have been separated by their provider. 



Datasets 


# 


Dim 


C 


RA 


ORL 


400 


10304 


40 


0.5 


UMIST 


575 


10304 


20 


0.5 


Iris 


150 


4 


3 


0.5 


Landsat Satellite 


6335 


36 


2 


0.1 


Image Segmentation 


2310 


16 


7 


210/2100 


Gaussian Elena 


5000 


8 


2 


0.5 


Breast Cancer Wisconsin 


569 


31 


2 


0.5 


Phoneme 


5404 


5 


2 


0.1 


Pendigits 


10992 


17 


10 


7494/3498 


Optdigits 


5620 


65 


10 


3823/1797 



Table 2. Experiment I: The influence of / to the classification performance on the five 
simulated datasets. Experiment II: The influence of i? max and comparison with other 
algorithms. The experiment results are the average of 20 repetitions. In the table, A±B 
means average error rate and standard deviation (%). 





Dl 


D2 


D3 


D4 


D5 


Experiment I: i? max = 0.5,7 = 


NSC (/ = 0) 


4.24 ±0.68 


13.38 ± 1.18 


8.09 ± 1.03 


9.08 ± 1.05 


3.85 ±0.56 


NSC-M (/ = 0) 


4.40 ±0.85 


13.27 ± 1.10 


9.52 ±0.92 


13.29 ± 1.55 


7.62 ± 1.04 


NSC (/ = 1) 


3.81 ±0.59 


11.70 ± 1.09 


6.01 ± 0.74 


6.94 ± 0.82 


2.97 ±0.57 


NSC-M (/ = 1) 


3.83 ±0.59 


11.66 ± 1.13 


6.54 ±0.79 


8.53 ± 1.00 


5.15 ±0.92 


NSC (/ = 2) 


3.87 ±0.81 


10.68 ± 1.05 


6.19 ±0.70 


6.55 ±0.72 


2.89 ±0.73 


NSC-M (/ = 2) 


3.79 ±0.76 


10.63 ± 1.06 


6.69 ±0.63 


7.79 ± 1.05 


4.29 ±0.84 


Experiment II: / = 2, 7 = 


NSC: iCax = 0.5 


3.41 ±0.51 


11.70 ±0.62 


5.77 ±0.42 


6.58 ± 0.57 


2.68 ±0.70 


NSC-M: i? max = 0.5 


3.43 ±0.52 


11.67 ±0.70 


6.23 ±0.49 


8.06 ± 0.76 


4.36 ± 1.01 


NSC: i? max = 1.0 


3.83 ±0.74 


10.60 ±0.76 


6.18 ±0.46 


6.77 ±0.70 


2.42 ±0.67 


NSC-M: i? max = 1.0 


3.78 ±0.74 


10.59 ±0.76 


6.46 ±0.52 


7.60 ±0.89 


3.42 ±0.88 


1-NN 


4.58 ±0.53 


13.24 ±0.80 


7.30 ±0.88 


7.88 ±0.71 


3.13 ±0.86 


3-NN 


4.20 ±0.63 


11.74 ±0.87 


6.65 ±0.76 


7.09 ±0.73 


2.83 ±0.77 


SVM-G 


3.24 ±0.62 


10.23 ±0.87 


5.46 ±0.73 


6.30 ±0.63 


2.35 ±0.52 
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Fig. 4. From left to right, from top to bottom: Dl to D5 datasets. In each panel, red 
dotted line and blue line denote 1-simplex and 2-simplex, respectively. The test sets are 
generated based on the same distribution. Note that the fifth dataset cannot be shown 
correctly in the three-dimensional space since two spheres which cross each other can 
only be seen in four or higher-dimensional space. 



100 objects which is collected with the same way as the COIL-20. We subsample 
each object image to colored 16 x 16 one, short for COIL-100A and gray 32 x 32 
one, short for COIL-100B. The SOIL-47A and SOIL-47B datasets are sampled 
from different illuminations [H]. Each dataset consists of 47 objects, each of 
which has 21 different views that are sampled every 9° around an axis passing 
through the object. Each object image is subsampled to a colored image with 
size 24 x 30. All of these images are directly served as the feature vectors. Some 
objects in the three practical datasets are shown in Fig. [5] 

As to the two face datasets, the UMIST face dataset [20] is a multi-view one 
for testing the robust of our approach, and the ORL dataset [2T] is also another 
popular benchmark one for face recognition. It is worth mentioning that our 
object recognition is in an instance level, i.e, all the data points in a data set 
belongs to the same category, and is not in the sense of the VOC (visual object 
classes) challenges. 

For comparison, we also compare the performance of our approaches with 
1-nearest neighbor (1-NN), 3-NN and SVM with Gaussian RBF kernels (SVM- 
G) (52] . The parameters in SVM are tuned by cross-validation. The whole train- 
ing set is used by these approaches. 
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Fig. 5. From Left to Right: examples of COIL-20 [T7], COIL-100 [E] and SOIL-47 [19] 
benchmark datasets. 



4.1 Simulated datasets and parameter influences 

We investigate the influence of / in the five datasets. Given / = 0, 1, 2, i? max = 
0.5 and 7 = 0, the average results of 20 repetitions are shown in Tab. [2] From 
the Tab. [2] we can see that the performance of the proposed approaches with 
/ = 2 is better in most cases. A possible reason is that as [4] pointed out, 
/ = 2 provides a clean persistent interval graph with little "noise". Therefore, 
it leads to a more stable structure representation. Note that in practical noisy 
environments, such graphs cannot be easily obtained. Meanwhile, f — 1 can be 
interpreted as arising from a family of coverings of the space X with Voronoi-like 
regions surrounding each landmark point. We thus set the parameter / to be 1 
or 2 in the subsequent experiments. Note that in "small training samples" case 
shown in the next subsection, we use Rips complex, which needn't the parameter 
/, for object recognition. Furthermore, we found that Mahalanobis distance is 
helpful to improve the performance of the proposed algorithms in some cases. 

We also study the influence of the parameter i? max in determining the optimal 
value R* , which is closely related to the selection of prime simplicial complex 
per class. We perform experiments on the five simulated datasets by selecting 
a group of i? max , followed by computing the corresponding R* . The results are 
shown in Tab. [2] and Fig. [6} From the results we can see that when i? max locates 
in an interval [0.3, 1], the classification performance are better. A reason is that 
the radius of these simulated datasets is close to 0.5. As a result, the topological 
structure can be preserved well when i? max is selected around 0.5. 

Furthermore, we compare the NSC approaches with 1-NN, 3-NN and SVM 
methods in the five 2-class datasets. The reported results are shown in Tab. [2j 
It can be seen from Tab. [2] that in these five datasets, the NSC approaches are 
always better than 1-NN and 3-NN, and achieve competitive performance com- 
pared with SVM-G. It is worth noting that as illustrated in Fig.|4j our approaches 
also preserve reasonable structure representations to these data distributions. 
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Two Concetric Circles [f=2] 




Fig. 6. Parameter Influence on the Dl simulated dataset. 



4.2 Small training samples and high-dimensional datasets 



We test the proposed approaches on four multi-view object recognition datasets, 
each of which can be regarded as generating from a circle-shape structure. We 
use four different views per class (i.e., 0°, 90°, 180° and 270°) and eight views 
per class as the training set, respectively. The remaining images are used as 
the test set. Since the number of training samples is small, we set 7 to be 1 to 
enhance the extrapolation ability of NSC, and employ Rips method [4] instead 
for the construction of prime simplicial complexes based on the whole training 
set. Here R is set to be 30. Note that R in Rips method is different from that in 
Lazywitness method. The results are shown in Tab. [3] From the results we can 
see that compared with NN and SVM algorithms, the proposed NSC approaches 
achieve the best performance in 4 out of 5 datasets. In SOIL-47B dataset, the 
performance of NSC is slightly worse than those of SVM algorithms. It indicates 
that the proposed NSC approaches can work well in high-dimensional multi-view 
structures. Note that here we haven't reported the results of NSC-M approach 
since the computation of covariance matrix is ill-posed when the number of 
samples is less than the dimension of a data set. Furthermore, we also observe 
that with 8 views as the training samples, our approach obtains the competitive 
performance as those state-of-art algorithms using 4 views in COIL and SOIL 
datasets 23J. However, the latter ones utilize very effective feature extraction and 
image registration techniques. In contrary, our approaches achieve a good trade- 
off between recognition accuracy and topology preservation by only introducing 
additional 4 views. 
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Table 3. The error rates and standard deviations (%) of several approaches in the 12 
practical datasets. Here '4V and '8V denote 4 and 8 views, respectively. 





NSC 


NSC-M 


1-NN 


3-NN 


SVM-G 


COIL-100A (4V) 


12.84 


N/A 


16.85 


28.40 


14.23 


COIL-100A(8V) 


2.81 


N/A 


5.33 


13.36 


4.06 


COIL-100B (4V) 


24.01 


N/A 


29.50 


43.66 


24.23 


COIL-100B (8V) 


7.50 


N/A 


12.78 


27.67 


8.36 


SOIL-47A (4V) 


16.67 


N/A 


19.12 


56.99 


21.08 


SOIL-47A (8V) 


11.61 


N/A 


13.54 


26.49 


12.65 


SOIL-47B (4V) 


22.92 


N/A 


23.41 


40.93 


22.67 


SOIL-47B (8V) 


15.33 


N/A 


18.30 


25.60 


14.58 


COIL-20 (4V) 


15.00 


N/A 


16.76 


28.24 


17.36 


COIL-20 (8V) 


2.97 


N/A 


5.39 


12.27 


4.85 


ORL 


7.02 ± 2.00 


N/A 


8.78 ± 2.62 


17.23 ± 2.5 


6.38 ± 2.01 


UMIST 


3.66 ± 1.52 


N/A 


5.34 ± 1.52 


11.05 ± 2.36 


6.43 ± 1.31 


Iris 


5.27 ± 2.10 


4.00 ± 2.10 


7.40 ± 1.81 


5.93 ± 2.10 


4.53 ± 1.80 


Landsat Satellite 


13.49 ± 0.44 


20.40 ± 0.61 


13.54 ± 0.38 


13.58 ± 0.55 


11.89 ± 0.54 


Image Segmentation 


6.38 


45.90 


7.10 


10.62 


6.05 


Gaussian-clcna 


15.24 ± 0.58 


34.61 ± 1.23 


20.15 ± 0.70 


18.52 ± 0.55 


9.98 ± 0.51 


Breast Cancer Wisconsin 


3.77 ± 0.95 


10.63 ± 2.03 


5.12 ± 1.32 


4.18 ± 0.83 


3.18 ± 1.12 


Phoneme 


19.91 ± 0.61 


21.85 ± 0.66 


16.13 ± 0.45 


16.57 ± 0.54 


15.40 ± 0.78 


Pcndigits 


2.20 


4.20 


2.57 


2.43 


1.83 


Optdigits 


3.06 


3.95 


3.45 


3.28 


1.56 



4.3 Face recognition 

We compare our approaches with others in ORL [3T] and UMIST [2U] face recog- 
nition datasets. In ORL dataset, the images of each subject are taken at dif- 
ferent times with various lighting, facial expressions and facial details [21]. In 
UMIST dataset, the images of each subject are taken by varying angles from 
left profile to right profile. We employ PCA to reduce the original dimensions 
to 40-dimensional subspaces since empirically, the subspaces preserve most of 
the principal structures. Furthermore, we also employ Rips method to construct 
the prime simplicial complexes based on the whole training set. The results are 
shown in Tab. [3] It can be seen from the results that NSC approach obtains the 
best performance in UMIST data set and ranks 2 in ORL data set. 



4.4 UCI datasets 

Finally, we evaluate the performance of the NSC approaches in 8 UCI datasets. 
Different from the aforementioned datasets, these datasets are taken from re- 
markably different domains. The results are shown in Tab. [3] We can see from 
the Tab. [3] that the proposed NSC approaches achieve competitive performance 
in these datasets. NSC ranks 1 in 2 of 8 datasets and ranks 2 in 5 of 8 datasets. 
It means that although devoting to preserve structures, the proposed NSC ap- 
proaches can also be applied to some general fields. 
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4.5 Discussion 

Here we perform a significant analysis to the proposed NSC approaches based 
on the results shown in Tab. [3| With the significance level of 5%, the p-value of 
the paired t-test results for the NSC approaches in the 20 data sets are shown 
in Tab. [4] It indicates that NSC, 1-NN and SVM-G are statistically similar in 
these datasets. 

Table 4. The p-value of the paired t-test results based on Tab. [3] The p-value in 
bold type indicates a rejection of the null hypothesis at the 5% significance level, 
which means there is significant difference between the two approaches. 





NSC vs 1-NN 


NSC vs 3-NN 


NSC vs SVM-G 


p-value 


0.4014 


0.0101 


0.8163 



We also want to discuss some limitations of the proposed approaches. Al- 
though our goal is to preserve the topological structure of datasets, first of all, 
the current persistent homology techniques can only provide some approxima- 
tions to the 'truly' topological invariants, as our approaches do. It is also unclear 
that whether the topological structures indeed exist in the high-dimensional data 
sets. Secondly, the evaluation is to a certain extend unfair to our approaches since 
other approaches use the whole training sets to train their models, whereas due 
to the nature of witness complex, we have to select at most use 50% of the 
training sets as the landmark points to build our classification model for large- 
scale training samples. Thirdly, the computational complexity is higher. Given 
the dimension is d, and the number of data set is n, specifically, the computa- 
tional complexity of Rips complexes is 0(d ■ n 2 ), and that of witness complexes 
is 0{r ■ d ■ n 2 ), where r is the ratio of the number of landmark points to n. 
Furthermore, the computational complexity of computing nearest distance from 
data point to the prime simplicial complexes is 0(n 2 ). When data are subject 
to Gaussian distribution, finally, the proposed approaches will lose their merits 
in recognizing objects. 

5 Conclusion 

We propose new structure-preserving NSC approaches by utilizing persistent 
homology technique in this paper. We refine the construction of simplicial com- 
plex by removing some simplices that are redundant to the NSC approaches. 
We present a new Barcode method to determine a prime simplicial complex per 
class for classification. We also propose a nearest projection technique by com- 
puting the distance from unlabelled samples to the prime simplicial complexes. 
Furthermore, we generalize the extrapolation ability of simplicial complexes with 
a projection constraint term. Experiments indicate that compared with several 
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well-known algorithms, our proposed NSC approaches achieve promising perfor- 
mance without losing the preservation of structure representation. 

In this paper, the proposed approaches does not consider how to deal with 
recognizing those faces in the wild. However, our goal is to design a topology- 
preserving classifier for object recognition and supervised learning, and the "face 
in the wild" problem can be avoided by employing Near-Infrared sensor to alle- 
viate the influence of background if we attempt to employ our approach to such 
a scenario. 

In the future, we will investigate how to employ the NSC approaches to other 
practical applications with more complex topological structures. Furthermore, 
how to construct a more suitable prime simplicial complex deserve study. More- 
over, we will study the performance of the proposed NSC approaches for object 
recognition in a category level rather than in an instance level. Finally, we will 
consider to further refine the performance of the NSC approaches by utilizing 
metric learning methods. 
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